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Abstract: 


The spike (S) protein of the recently emerged human coronavirus (MERS-CoV) mediates 
infection by binding to the cellular receptor dipeptidyl peptidase 4 (DPP4). Here we mapped the 
receptor-binding domain in the S protein to a 231-amino acid fragment (residues 358-588) by 
evaluating the interaction of spike truncation variants with receptor expressing cells and soluble 
DPP4. Antibodies to this domain - much less so to the preceding N-terminal region - efficiently 
neutralize MERS-CoV infection. 

Text 

Just 10 years following the outbreak of the severe respiratory acute syndrome coronavirus 
(SARS-CoV) the world is confronted with yet another deadly human coronavirus. The virus, first 
provisionally called human coronavirus-EMC (hCoV-EMC) (1, 2) but now named MERS-CoV, 
referring to its emergence in the Middle-East and to the respiratory syndrome it causes, 
belongs to the betacoronavirus genus lineage 2c (3). As of June 7 th 2013, 55 cases have been 
laboratory confirmed including 31 deaths, all from - or linked to - the Arabian Peninsula (4). Like 
with SARS-CoV, patients affected by MERS-CoV suffer from severe and often lethal lower 
respiratory tract infection. The epidemiology of MERS-CoV is still enigmatic, but the 
geographical distribution of epidemiologically unlinked individuals points to intermittent, 
zoonotic transmission from a - so far unknown - animal source, whereas a number of reported 
clusters indicate limited human-to-human spread (5). 

The main determinant of coronavirus tropism is the viral spike (S) protein as it mediates 
binding to a cell-surface receptor. The MERS-CoV S protein, a 1353 amino acid type I 
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membrane glycoprotein, assembles into trimers that constitute the spikes or peplomers on the 


surface of the enveloped coronavirus particle. The protein combines the two essential entry 
functions, namely that of host receptor binding and membrane fusion, which are attributed to 
the N-terminal (SI, residues 1-751) and C-terminal (S2, residues 752-1353) half of the S 
protein, respectively (Fig.la). Recently we have identified dipeptidyl peptidase 4 (DPP4, also 
known as CD26), expressed in the human lung, as a functional receptor for MERS-CoV(6). 
Importantly, MERS-CoV can also use the evolutionary conserved DPP4 of other species, most 
notably that of bats(6, 7). 

Coronaviruses bind to receptors via independently folded, generally about 150-300 
residues long receptor binding domains (RBD) present in their SI subunit, of which the location 
within SI can vary (8-10). Thus, for the betacoronavirus mouse hepatitis virus (MHV) the 
binding to its CEACAM receptor (11) has been mapped to the N-terminal -300 amino acids of 
the spike protein (12, 13) whereas for the SARS-CoV - of the same genus - binding to the ACE2 
receptor (14) maps to residues 323-502 of SI (15, 16) (Fig.la). Identification of the RBD can 
hence help the development of monoclonal antibodies and vaccines for the treatment and 
prevention of infection. The RBD is the most important target for neutralizing antibodies (12, 17, 
18) preventing virus-receptor interaction. 

We previously used the SI domain of MERS-CoV fused to the Fc-region of human IgG to 
demonstrate the interaction of SI with DPP4-expressing cells and with soluble, i.e. non 
membrane-anchored DPP4 (6). To identify the receptor binding domain in the MERS-CoV SI 
subunit, we generated Sl-Fc protein chimera's with truncations at the C-terminus and 
N-terminus of the SI domain. We considered a three domain structure of the MERS-CoV SI 
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protein (residues 1-357, 358-588 and 589-747) based on the predicted location and structure of 


the RBD of two other betacoronaviruses, MHV and SARS-CoV (12, 13, 15, 16), of which the 
homologous regions for MERS-CoV S map to the residues 18-351 and 379-580, respectively 
(Fig. 1b). In addition, a soluble form of human DPP4 (residues 39-766) was made, which was 
C-terminally tagged with the Fc region. These proteins were expressed in HEK-293T cells after 
transfection of the respective expression plasmids and subsequently affinity-purified from the 
cell culture supernatant using protein A sepharose beads as described(6). The Fc region of 
purified sDPP4-Fc was proteolytically removed using trypsin (data not shown). First, we 
analyzed the Sl-Fc proteins and C-terminal SI truncations thereof for their ability to interact 
with sDPP4 using a co-purification assay. sDPP4 was efficiently co-purified by the Sl-Fc 
variants encompassing residues 1-588 and 1-747 whereas the 1-357 Sl-Fc variant was unable 
to bind sDPP4 (Fig.2a). We next generated an Sl-Fc variant comprising residues 358-588, a 
region homologous to the ACE2 receptor binding domain in SARS-CoV SI (Fig.2a). This Sl-Fc 
truncation variant efficiently bound soluble DPP4, indicating that the DPP4 receptor binding 
domain is located within the 358-588 residues domain of the MERS-CoV spike protein. 

We subsequently tested the ability of these Sl-Fc variants to bind to HEK-293T cells 
transiently expressing DPP4 by using flow cytometry. The Sl-Fc variants encompassing 
residues 1-588 and 358-588 bound to DPP4-expressing HEK-293T cells with efficiencies 
comparable to the full-length SI protein whereas no binding was observed with the 1-357 
Sl-Fc variant (Fig.2b). These data show the 358-588 amino acids SI region to be essential and 
sufficient for binding to DPP4-expressing cells, consistent with the results of the sDPP4 
interaction study. 
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To confirm the observed interactions in a more biological assay we analyzed the ability of 


the Sl-Fc variants to prevent MERS-CoV infection. Thus, Huh-7 cells were preincubated with 
the different Sl-Fc variants before being inoculated with the MERS-CoV. We found that the 
variants encompassing residues 1-747 and 358-588, but not the 1-357 Sl-Fc variant, inhibited 
infection (Fig.2c). 

Finally polyclonal antibodies were raised in rabbits against the 1-747, 1-357 and 358-588 
Sl-Fc variants (Davids Biotechnology GmbH, Germany). The sera, which displayed equal 
ELISA titers towards its antigen (1:300.000, data not shown), were tested for their ability to 
neutralize virus infectivity. Antibodies raised against the 358-588 Sl-Fc variant efficiently 
neutralized virus infectivity, superior to those raised against the 1-747 and 1-357 Sl-Fc variants 
(Fig.2d). This indicates that neutralizing epitopes within SI are primarily localized to the RBD 
region. The elicited antibodies are likely to block the interaction of the spike protein with DPP4 
thereby neutralizing MERS-CoV infectivity. Of note, antibodies raised against the MERS-CoV-S 
RBD did not cross-neutralize SARS-CoV infection (data not shown). The results demonstrate the 
potential of SI protein and of the 358-588 SI polypeptide as subunit vaccines with a high 
biosafety profile compared to vaccines based on inactivated viruses or live-attenuated virus. 

Except for the betacoronavirus MHV, which binds to its CEACAM receptor through a 
domain in the N-terminal part of its SI protein, the RBDs of all other coronaviruses that engage 
protein receptors and that have been mapped occur in the C-terminal portion of the SI subunit 
(Fig.3). Examples also include the alphacoronaviruses binding to ACE2 (hCoV-NL63) and APN 
(e.g. TGEV, hCoV-229E) (10, 19-25). In this study we have experimentally mapped the RBD of 
MERS-CoV to a 231-amino acid fragment (residues 358-588) within the spike protein. This 
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domain nicely corresponds with the SI region recently anticipated to interact with the DPP4 


receptor on the basis of theoretical SI structure predictions (26). The RBD in the MERS-CoV 
SI protein localizes in the same region where the SARS-CoV S protein interacts with its ACE2 
receptor (26).The SARS-CoV RBD structure displays a 5-stranded [3-sheet core structure (pi-4 
and (37) maintaining the overall domain conformation, and a long extended loop containing two 
anti-parallel [3-sheets ([35 and (36) responsible for receptor binding(16). Intriguingly, compared 
to SARS-CoV, the RBD of MERS-CoV contains a relatively conserved core domain but a highly 
variable loop region, tentatively explaining the differential receptor usage(26). Crystallization 
and structure analysis of this MERS-CoV RBD region in complex with DPP4 will give detailed 
insight into the virus-receptor binding interface. 

Figure legends 

Figure 1 . Receptor binding domains in betacoronavirus spike proteins and Sl-Fc expression 
constructs, a, Schematic representation of the betacoronaviruses SARS-CoV, hCoV-EMC S 
and MHV (strain A59) spike (S) protein sequence (drawn to scale) aligned at the SI-S2 junction. 
The known receptor binding domain in the SI subunit of MHV and SARS-CoV S proteins and 
their corresponding homologous regions in hCoV-EMC S as defined by ClustalW alignment are 
indicated. Positions of the transmembrane domain (yellow bar; predicted by the TMHMM server) 
and of the predicted A/-glycosylation sites (4*; predicted by the NetNGIyc server, only shown for 
the hCoV-EMC S) are indicated. The border between the SI and S2 subunits of the spike 
protein is represented by a vertical white line, b, upper panel, schematic presentation of the 
hCoV-EMC SI subunit (residues 1-751) sequence. Cysteine positions in SI subunit are 
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indicated by vertical white lines with corresponding amino acid positions on top. Positions of 


cysteines highly conserved among betacoronaviruses SI proteins are in bold. Predicted 
disulfide bond connections inferred from the structure of the SARS-CoV receptor binding 
domain are presented as connecting black lines underneath. Lower panel, domains of the 
hCoV-EMC SI subunit expressed as Fc chimera's. 

Figure 2. The DPP4 binding domain is located within residues 358-588 of the hCoV-EMC spike 
protein and efficiently elicits neutralizing antibodies, a, Sl-Fc chimeric proteins and soluble 
DPP4 (sDPP4) receptor were expressed from HEK-293T cells and purified from the culture 
supernatant. Sl-Fc proteinswere mixed with sDPP4 followed by protein A sepharose affinity 
isolation, analyzed on a NoVEX® 4-12% Tris-Glycine gradient gel under non-reducing 
conditions, and stained with GelCodeBlue reagent. Position of the Sl-Fc proteins - running as 
dimers under non-reducing conditions due to an Fc interchain disulphide bond - and sDPP4 as 
well as the sizes of the marker proteins are indicated. Individual proteins were loaded as 
controls, b, Binding of hCoV-EMC Sl-Fc proteins to DPP4 expressing cells. 2.5x10 5 HEK-293T 
cells transfected with control pCAGGS (grey shaded area) or with pCAGGS-DPP4 (black line) 
expression plasmid were incubated with 15 pg/ml of the indicated Sl-Fc followed by incubation 
with DyLight488 labeled goat-anti-human IgG antibody and analysis by flow cytometry. An 
Fc-chimera containing the SI of infectious bronchitis virus (IBV-SI-Fc) was taken along as a 
negative control, c, Inhibition of hCoV-EMC infection by Sl-Fc 1-747, 1-357 and 358-588 
variants. Huh7 cells were preincubated with 40 pg/ml Sl-Fc 1-747, 1-357 or 358-588 for 0.5 h 
prior to virus inoculation (1 h), all at RT. Mock incubated cells (control) and cells incubated with 
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a DPP4 polyclonal antibody (anti-DPP4) were taken along as controls. Following incubation at 


37° C for 8 hours, infected cells were detected by immunofluorescence and infection was 
quantified (relative to control). The experiment was carried out twice and the data of one a 
representative experiment are shown. Error bars indicate standard error of the mean, d, 
Neutralization of hCoV-EMC infection by rabbit antisera raised against the Sl-Fc 1-747, 1-357 
and 358-588 variants. Virus (200 pfu) was premixed 1:1 with serial dilutions of sera obtained 
(open bars) or after immunization (closed bars) prior to inoculation onto VERO cells and virus 
infection was monitored by the occurrence of CPE at 72 hours post infection. Virus 
neutralization titers (VNT) were determined in quadruplicate as the highest serum dilutions that 
completely prevent CPE. The experiment was carried out twice and the data of one a 
representative experiment are shown. Error bars indicate standard error of the mean. 

Figure 3. Localization of receptor-binding domains in coronavirus spike proteins. Schematic 
presentation of the spike proteins of the alphacoronaviruses TGEV and hCoV-NL63 and of the 
betacoronaviruses SARS-CoV, hCoV-EMC and MHV (drawn to scale), aligned at the S1-S2 
junction. Blue boxes represent the receptor-binding domains (RBD) and indicate the engaged 
receptor. The RBD of TGEV, hCoV-NL63, SARS-CoV and MHV have been confirmed by 
crystallography (12, 15, 22, 26). Grey boxes indicate the transmembrane domain. Sequence 
IDs: TGEV (ABG89335.1), hCoV-NL63 (NC_005831.2), SARS-CoV (NP_828851.1), 
hCoV-EMC (AFS88936.1), MHV (NC_001846.1). 
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